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Abstract 

We consider Gibbs distributions on finite random plane trees with 
bounded branching. We show that as the order of the tree grows to 
infinity, the distribution of any finite neighborhood of the root of the tree 
converges to a fimit. We compute the fimiting distribution expficitly and 
study its properties. We introduce an infinite random tree consistent with 
these limiting distributions and show that it satisfies a certain form of the 
Markov property. We also study the growth of this tree and prove several 
limit theorems including a diffusion approximation. 

1 Introduction 

Various kinds of random trees have been studied in the literature. In 
this note we consider simply generated random (plane rooted) trees also 
known as branching processes conditioned on the total population (CBP), 
see [AldQlb) . Our initial motivation was a study of the secondary struc- 
ture statistics for large RNA molecules, see [BH08] and |BH09) . The 
secondary RNA structures can be encoded via plane rooted trees and 
studied with the help of energy models. In [BH08| and |BH09) . it is 
demonstrated that the naive energy minimization approach to the predic- 
tion of typical secondary structure features fails to explain the presence 
of high degree branchings. However, using the language of statistical me- 
chanics and working with Gibbs ensembles on trees, we were able to in- 
clude the entropy correction and recover the typical RNA branching type. 
These results are concerned only with the rough information related to the 
branching statistics, but in this paper, we suggest a new viewpoint that 
helps to obtain some insights into the geometry of large random trees. 

The model we work with follows the classical Boltzmann-Gibbs pos- 
tulate stating that the probablity of a configuration T is proportional to 
g-/3B(T)^ where E{T) is the energy of T, and /3 is the inverse temperature 
in appropriate units (see the complete description of our model in Sec- 
tion[21). Gibbs distributions, especially their limiting behaviour under the 
limit of the size of the system tending to infinity (so called thermodynamic 
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limit), are central to statistical mechanics, see [Sin82) and [Geo88] for a 
modern mathematical introduction. 

The first goal of this paper is to prove that as the order of the tree 
grows to infinity, the distribution induced by the Gibbs measure con- 
verges to that of an infinite discrete tree that we explicitly describe in 
detail (Sections [2] to [SJ . This thermodynamic limit belongs to the cate- 
gory of discrete limits of GBP according to the terminology introduced 
in [Ald9f b) . and our result (as well as the limiting object) appears to be 
new. In particular, it does not involve any rerooting procedures like the 
one introduced in [AldQla] . We prove the result above for the bounded 
branching (or out-degree) case, although it should hold true under less 
restrictive assumptions. 

The limiting infinite discrete tree is a more sophisticated object than 
a classical Galton-Watson tree. In particular, it dies out with zero proba- 
bility and the progenies of distinct vertices are not independent. However, 
it turns out that the limiting tree is Markov in a natural sense, and the 
Markov transition probability is explicitly computed in Section [S] In Sec- 
tion [6] we notice that the number of vertices at a given distance n from 
the root also form a Markov chain if n is understood as a time param- 
eter. We prove that under linear scaling this Markov chain satisfies a 
limit theorem with the limit given by a gamma distribution. In Section [7] 
we strengthen this result and show that a functional limit theorem holds 
with weak convergence to a diffusion process on the positive semi-line with 
constant drift and diffusion proportional to the square root of the space 
coordinate. Since this process (under the name of local time for Bessel(3) 
process) also serves as a scaling limit of the "height profile" for GBP it- 
self, see |Ald9f bl Gonjecture 7] and [Git98) . we can say that the infinite 
Markov random tree that we construct belongs to the same universality 
class as the original GBP. 

There are several natural and interesting problems arising in connec- 
tion with our results. One is, obviously, strengthening them to give an 
alternative to [Git98) proof of the scaling limit in Aldous's Conjecture 7. 
Another one is to use our approach to study finer details of the random 
tree rather than the height profile. Our heuristic computation (see Sec- 
tion |S} shows that the limit can be described as a solution of an SPDE 
with respect to a Brownian sheet. 

Acknowledgements. The author is grateful to NSF for partial sup- 
port of this research via GAREER award DMS-0742424. He also thanks 
the referees for their useful comments. 

2 The setting and first results on ther- 
modynamic hmit 

Let us recall that plane trees (or, ordered trees) are rooted trees such that 
subtrees at any vertex are linearly ordered. In other words, two plane 
trees and are considered equal if there is a bijection between the vertices 
of the two trees such that it preserves the parent — child relation on 
the vertices and preserves the order of the child subtrees of any vertex. 
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Figure 1: Five different plane trees of order 4. 

Figure [l] shows all plane trees on 4 vertices. 

We fix D G N and introduce Tjv = Tjv(_D), the set of all plane trees 
on N vertices such that the branching number (i.e. the number of children, 
or out-degree) of each vertex does not exceed D. To introduce a Gibbs 
distribution on Tjv, we have to assign an energy value to each tree. We 
assume that an energy value _Ei G R is assigned to every i £ {0, . . . , D}, 
and the energy of the tree T is defined via 

D 

veV{T) i=0 

where V{T) denotes the set of vertices of the tree T, deg(?;) denotes the 
branching number of vertex v, and Xi{T) is the number of vertices of 
branching i in T. Since the energy of an individual vertex depends only 
on its immediate neigborhood via the branching number, one can say that 
this a model with nearest neighbor interaction. 

Now we fix an inverse temperature parameter /3 G R (usually, in sta- 
tistical physics /? > 0, but our results apply to other values of /? as well) 
and define a probability measure /ijv on Tjv by 

g-/3E(T) 

Miv{r} = — y — , 

where the normalizing factor (partition function) is defined by 

TgTjv 

In particular, if /3 = or, equivalently, Ei = for all i, then /ijv is a 
uniform distribution on Tjv. 

First, we are going to demonstrate that the above model admits a 
thermodynamic limit, i.e. the sequence of measures (/^]v)]veN has a limit in 
a certain sense as — s- oo. Secondly, we study several curious properties 
of the limiting infinite random trees. 

For each vertex w of a tree T G Tjv its height h{v) is defined as the 
distance to the root of T, i.e. the length of the shortest path connecting v 
to the root along the edges of T. The height of a finite tree is the maximum 
height of its vertices. 

Let n, A'^ G N. For any plane tree T G Tat, Hn,NT denotes the neighbor- 
hood of the root of radius n, i.e. the subtree of T spanned by all vertices 
with height not exceeding n. 
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For any n and sufficiently large N, the map 7r„,jv pushes the mea- 
sure UN on Tjv forward to the measure MivTr"^ on Sn, the set of all trees 
with height n. 

Theorem 1 For each G N, the measures /iivTr^jv ■S'n converge in 
total variation, as N oo, to a measure P„. 

A proof of this theorem will be given in Section[3l At this point we prefer 
to introduce more definitions that will allow us to describe the limiting 
measures P„. 
We define 

( D D 

A = lp= (po, . . . ,Pd) G [0, 1]^+^ : ^p,: = f, ^ip, = 1 

I i=0 i = 

and let 

J{p)^-H{p) + ^E{p), PGA. 

where 

D 

H{p) ^ - "^Pi In Pi 
is the entropy of the probability vector p G A, and 

D 

E{p)^Y.P^E^ 

is the associated energy. 

The function J is used to construct the rate function in the Large 
Deviation Principle for large plane trees, see p3H08 , BH09 . 

It is strictly convex and its minimum value on A is attained at a unique 
point p*. Using Lagrange's method, we find that 

Inp* + 1 + l3Ei + \i+iX2=0, j = 0, 1, . . . , D, 

where Ai and A2 are the Lagrange multipliers. So we see that 

p* =C7e-''^'p\ i = 0,l,...,A (1) 

where C = e"^""^^, and p — e~^^. In particular, 

p*>0, i = 0,l,...,D. (2) 

Notice that p can be characterized as a unique solution of 

D D 

2_^e ^ 'p = 2^Je ^ 'p , 

i=0 i=0 

and C may be defined via 

^ D D 

- = ^eP^p=^ieP^p. (3) 

1=0 1=0 
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We denote J* — J{p*) and a = . For a tree t £ Sn, we introduce 

E{r)= J2 ^dcg(.). (4) 

h(v)Kn 

Notice that the summation above excludes the highest level of the tree. 

Theorem 2 For any n G N, the limiting probability measure Pn is given 
by 

Pn{T} = Q„fcp'=a'"e-''^'^) (5) 

where the tree r £ Sn is assumed to have k vertices of height n and m 
vertices of height less than n. The constant Qn is a normalizing factor. 

We give a proof of Theorems [T] and [5] in the next Section O In Section [3] 
we compute the value of Qn explicitly. In Section [5] we shall see that 
our convergence results may be interpreted as convergence to an infinite 
random tree. 

Remark 1 Although Theorems\l\ and\^ do not hold in full generality for 
D = 00, we expect that there is a large class of energy functions for which 
analogous results are true. 



3 Proof of Theorems [T] and [2] 

For both theorems it is sufficient to check that for any n and any two trees 
n, T2 G S„, 

where we assume that ri has ki vertices of height n, and mi vertices of 
height less than n; T2 has k2 vertices of height n, and m2 vertices of height 
less than n. 

The energy of each tree T with iin.NT — ti is composed of contribu- 
tions from the vertices of the tree ri of height less than n (we call this 
contribution E{ti), see Q) and the contribution from the plane forest on 
N — mi vertices with ki connected components. The same applies to T2 . 

Let us recall (see e.g. Theorem 5.3.10 in |Sta99| ) that the number of 
plane forests on A*" vertices with k components and ro, ri, . . . , ro vertices 
with branching numbers, respectively, 0, 1, . . . , D is 

/ iV 
A'' I ro, n, . . . , ro 

if ro + . . . + ro = N , ri + 2r2 + . . . + Dro — N — k, and otherwise. 
Therefore, 

P-'V''„,Jvin J- _ t= Z^reA(JV,7ni,fei) JV-mi \ro, ri , ro^ 

_ e-^^'-^'Ji(jV) 
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Here 



A{N,m,k) = {r eZ^ 



ro + . . . + m = N - m, 

ri+2r2 + ... + Dm ^ N - m - k}, 



and Z+ = N U {0}. 

Fix any e > and define 



A(iV, m,k,e) = -{r e A{N, m, k) 
We claim that 



< £ 



h{N)^h{N,e){l + o{l)), N 



(8) 



where 



h{N,e) 



E 



ki 



N - 



N - 



mi 



mi \ ro, n, 



-liE(r) 



rSA(JV,mi ,fci ,e) 

In fact, using Stirling's formula we see that if 7^ for all i = 0, 



N -mi 
N - mi \ro, ri, . . . , m 



ki{N ~mi)' 



2 e 



-l3E{r) 



-PE{r) I2(iv-mi) ' 



(2^)- 



kie 



-(N-mi)J(-, 



((A'' — mi)ro . . . ro) 2 



(27r) 



with < < 1 for all j G N. If A'^ is sufficiently large, there is a vector 
r*{N) G A(iV, mi,fci,e) such that j < e/2. Due to the strong 

convexity of J, there is a number 5 > independent of A*' such that 



min J 

A(W,mi,fei)\A{Af,mi,fci,£) 



N -mi 



> J 



r*{N) 
N -mi 



+ 5, 



so that the contribution from each element of A{N, mi , ki)\A{N, mi, ki,e) 
is exponentially smaller than that of r*{N) as N 00. The statement 
follows since the number of elements in A(N, mi, ki) \ A{N, mi, ki, e) is 
bounded by A''^^^. This argument can be easily extended to the case 
where ri = for some i, which completes the proof of our claim 
Let us now define for r G A(A', mi, fci, e), 

b(r) = (ro + (fc2 - fci), n - (fc2 - ki) - (m2 - mi), r2, rs, . . . , ro). 

Notice that for sufficiently small e and sufficiently large A', the image 
A'{N, e) of A{N, mi, ki, e) under 6 is a subset of A(A", m2, fe). Moreover, 
b is invertible and, therefore, establishes a bijection between A(A^, mi, fei, e) 
and A'(A',£). 
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Introducing 

and using exactly the same reasoning as for Ii, we see that 

/2(iV) = /2(iV,e)(l + o(l)), N^oo. (9) 
Equations Q , (O , (O imply now that 



(1 + 0(1)) 

(l + o(l)), iV->oo, 



(10) 



where 



and 



N -mi 
ai,r =1 I e 

ro, ri, . . . , ro 



/3i5(r) 



a2r=( N-m2 \-PE(b(r))^ 

yro + (fc2 - fci), n - (fc2 - fci + m2 - mi),r2, . . . , roy 

Assuming that fci > fe2 and mi > m2 (all the other cases can be 
treated in the same way), we get 

Ol.r _ ((ri - (fc2 - fci) - (7712 - mi)) . . . (ri + 1) ^ 

a^r ^ (iV - m2) . . . (iV - mi + 1)) ■ (ro . . . (ro + (fc2 - fci) + 1)) ' 

where 

i? = i?(fci,mi,fc2,m2) = e''^''''-^i'^*=-'i^-''''i''"=-"'i^ 
Due to the definition of A(7V, m, fc, e), 

ai.r- ^ ((pt + e){N - mi) - (fca - fci) - (m2 - 777i))-"'^-''i'-('"^-'"^) ^ 
a2,T ~ (A^ - mi)-(™2-mi)((p. _ £)(-7v- - rrai) + (fe - fci))-('=2-'=i) ' 

so that 

limsup sup ^ < (p*+e)-('"2— i) /'pI±£'\ "'^^ 

N^oa reA(N,m,k,i:) 0-2,r \Po ^ ^ / 



Pi+eJ V Pi+e 

(11) 

In the same way, 
hminf inf ^ > f ^I^^i) " f + -)-^'"-"' ) 

(12) 



N-nx rgA(]V,m,fc,e) 02, r \Pl ~ £/ V Pi ~ £ 
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Since the choice of e is arbitrary, relations (|10|) . (|1H) . and (|12p imply 
that 

iv"?^ lJ-Nn-\,{T2} ^ k2e-^E(r2) [ pi ) V Pi / 

(13) 

Using (m, we see that 

= -. (14) 

Pi P 

A direct computation based on ([T]) and ^ implies 
H{p*) = -\n{Cp) + l3E{p'). 

Therefore, 



e-'^"'^ = i. (15) 



Pt Cp a 
Now, ((6| is an immediate consequence of (|13|I . (I14|I . and p5|) . □ 



4 Consistency and the precise value of Q„ 

We begin with the following consistency property: 

Theorem 3 The family of measures {Pn)n&i is consistent, i.e. for any 
n and any t G Sn 

T'es„+i 

where tt^^^ denotes the projection map from Sn+i to Sn. 

Proof: This theorem is a direct consequence of the limiting procedure in 
Theorem [T] However, it is interesting to derive it from the specific form 
of P„ provided by Theorem [S] 

Let us assume that r G Sn, and r has n vertices of height k and m of 
height less than n. 

= Q„+i J2 e-^f^^-^+'^'^ + -+''^>^\i^ + ... + i,)p'- + -+^''a"^+'' 
= Q„+ie-''^Ma'"+'= J2 e-«^^ + -+^^'(ii + ...+4)p'^ + -+'^ 

il H-=0 

^ -0E(t) m + kj ^ f ^\ 
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In this calculation we denoted hy ii, ... ,ik the branching numbers of the 
vertices of height n. We used the definition of P„ in the first identity. The 
second identity is just a convenient rearrangement. The third one follows 
from the symmetry in the factor {ii + . . . + ik) . In the last identity we 
used and the fact that p* € A. Identity ([T3|l implies 



SO that 



ET-> r /T —3E(t) m + fc 7 —k k Qn + 1 7-, r t /i 't\ 

Pn-\-i{r \ = Qn^ie ^ ^ V ka p =— — Pn.{T}. (17) 



Since this holds true for all r £ S„, we can conclude that Q„ = Q„+i 
which completes the proof. □ 

Identity (|17|l means that the constant Q = Q„ in Theorem [5] is the 
same for all n. Choosing n = 1 we can compute it using ([Sj: 



A more precise version of Theorem [2] easily follows: 

Theorem 4 Let C he defined by ((3)). For each n, the limiting probability 
measure Pn is given by 

P„{t} = Cfee-'^^("V''^™"\ 

where the tree t £ S„ is assumed to have k vertices of height n and m 
vertices of height less than n. 



5 The limiting random tree 

Let Soo be the set of infinite plane trees with branching number bounded 
by D. Theorem [3] along with the classical Daniell — Kolmogorov Consis- 
tency theorem (see [Bil99) ) allows us to introduce a measure Poo on 5*00 
consistent with measures Pn for all n. Intuitively this is clear, but to make 
it precise we need to introduce a coding of plane trees. We have chosen 
one of several possible coding schemes. 

Let T be a plane tree (finite or infinite) with branching bounded by 
D. Then T has a finite number r„ < of vertices of any given height 
n. Let us say that all vertices of the same height n form the n-th level 
of the tree. The vertices of n-th level are naturally ordered and can be 
enumerated by numbers from 1 to r„ (except for the case when there are 
no vertices at n-th level at all). Each of r-„ vertices of the n-th level has 
a parent at the level n — 1. Denote the number received by the parent of 
Z-th vertex of the n-th level under the described enumeration by g„ j. If 
r„ < I < -D" we set g^^i = 0. We also agree that for the root of the tree, 
i.e., the first vertex in the zeroth level, go,i = 1. 
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Then for any n > the n-th level can be encoded by a vector 

3n = (gn,l. • • • G {0, 1, • • . , 

and the whole tree can be identified with the sequence of levels 

oo 

(<7i,52,...)GX=n{0,l,...,i3"~'}°", 

n = l 

so that the space T of all plane trees (finite or infinite) with branching 
bounded by D can be identified with a subset of X. 

Theorem 5 There is a unique measure Poo on T such that it is consistent 
with measures P„: 

Poo — ! 

where 7r„ denotes the root's neighbourhood of height n of a tree from T. 
This measure is concentrated on Soo- 

Proof: The first statement follows from Theorem [3] and the Consistency 
theorem. The second statement is a consequence of the fact that for each 
n £ N, Pn is concentrated on trees with positive number of vertices at 
n-th level. □ 

The space X is compact in the product topology. Therefore, the con- 
vergence of finite-dimensional distributions established in Theorem [1] and 
the classical Prokhorov theorem (see e.g. Bil99,) imply the following re- 
sult: 

Theorem 6 As N — > oo, measures Pm viewed as measures on X converge 
weakly to Poo in the product topology. 

This statement shows that there is a limiting object for the random trees 
that we consider. This object is an infinite random tree. For any n £ N, 
the first n levels of this random tree are distributed according to P„. 

Let us now embed the space X into X — (Z!J.)^+ filling up all the unused 
coordinates with zeros. The measure Poo can be treated as a measure on 
X thus generating a -valued process (X„)55!Lo with discrete time. This 
process along with the associated random tree is visualized on Figure [S] 
For any n, the map X„ describes how the n-th level of the tree is built 
upon the n — 1-th one. 

For a level g : N — >■ Z+, we denote by \g\ the number of non-zero entries 
in g (i.e. the number of vertices at the level). For two levels g and g' we 
write g < g' if max; g[ < \g\. If g <\ g' then we define 

fli 
1=1 

the energy induced by level g' at its parent level g. 

Theorem |4] immediately implies the following result: 

Theorem 7 The process (Xn) defined above is Markov with transition 
probability 

10, otherwise. 
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6 A limit theorem for the size of n-th 
level 

Let us introduce Y„ — \X„\, the random number of vertices at n-th level. 
The following statement is a direct consequence of Theorem |4] or Theo- 
rem [T] 

Theorem 8 The process {Yn)^=o Markov with transition probability 

nr-ir 7 1 ^' k'~k k -l3(Ei, +... + Ei, ) 

P{Yn+i = fc I F„ = fc} = —p a e ' ^ 'ki , 

0<n,...,ifc<i3 

il+...+ifc=fe' 

The next theorem shows that in fact Y„ grows linearly in time. Let 

M = Sa - 1, (18) 

where 

D 

Then jj, > being the variance of p* , a nondegenerate distribution. 
Theorem 9 

Yn 2 Law Ti 

^ r, 

n fi 

where T is a random variable with density 



p{t) = 



te'\ t>0 
0, t<0 
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Proof: Let us find the Laplace transform (generating function) of tlie 
distribution of Y„ : 

L„{s) = Ee^^", s < 0, 

(this definition differs from the traditional one by a sign change of the 
argument) and prove that for any a: < 0, 

lim L„ (-) = — =: L^{x), (19) 

n->oo \nJ (i _ my 

the r.h.s. being the Laplace transform of 

At _ 21 
p{t) = — e A" , 

the density of the r.v. ^F.This will imply the desired result, see e.g. |Kal86[ 
Appendix 5] for various statements on Laplace transforms. 
Theorem |8] and (|16p imply 

fe' 0<ii,...,i^<D 
il + ...+ik=k' 

y (n + ... + ife)p'i+-+''=e^('i+-+''='e-''(^'i+-+^''=) 
kp" ^ — ' 

' 0<ii ,...,ifc<-D 

= 'm(s')v(a')^~^ , 
where 

i = i = 

and 

D D 

w{s) — iple'"' = C'ip^e~^^^e"' — v'{s). 

i=0 1=0 

Therefore, 

L„+i(s) = Ew{s)v{sf-'' = !i^Egin.(.)y„ ^ ^(s)L„(/(s)), (20) 

u(s) 

where 

f{s)=lnvis), (21) 

and 

Both z and / are analytic functions. An elementary calculation shows 
that 

f(s) = s+!^s'+r{s), 

and 

ln2(s) ^ fis + g(s), 
where p, = w'{0) ~ 1 was introduced in (I18|) and 

|r(s)|<c|.^ |g(s)|<c|s|^ (22) 
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for some c > and all s < 0. 

From now on, a; < is fixed. Using (|20|l and the obvious identity 



Lo{s) = e'' 



we can write 
L 

where 



fix)^fof...of{x), k>0, 



so that we have to study the numbers (xn,k)nGm,k=o,...,n defined by 

x„,k = f' (9- 

We shaU compare (a;„,fc)„gN,fe=o,...,n to (j/„,fc)„gN,fc=o,...,n defined by 



Un.k — n _ 

X 2 

For fixed n, both sequences (xn^k) and (yn,k) are negative and increas- 
ing in k. Therefore 

\Xn,k\ < \x„fi\ = — , 



and 



\yn,k \ < \yn,o\ = 



Let us prove that for sufficiently large n and any k between and n, 



\yn,k — Xn^k\ < k [ — + c] ( 



(23) 



This is certainly true for fc = 0. For the induction step, we write 

Ij/n.fc - X„^k\ < \yn,k - f{yn,k-l)\ + 1/(^,1, fc-l) - /(Sn,fc-l)l = /l + h- 

A straightforward computation based on H21|) shows that 



|/i| = 



2 

M 2 , . 

— yn,k-iyn,k + r(y„.k-i, 



Since |/'(s)| < 1 for all sufficiently small s, we see that 

1-^2] < lyn,fe-l — Xn,k^l\- 

Combining these estimates we see that 



\yn,k - Xn,k\ < ( — + C 



+ \yn,k-l — Xn,k-l\ 



and our claim (|23p follows. It immediately implies that 

C 



\yn,k ~ Xn,k\ < 



(24) 
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for some K = K{x), sufRciently large n and all k. We can now write 

n-1 

InLn — j = ^ ln2:(a;„_fc) + a;n,n 



n — 1 n — 1 

= ^ ^Xn,*! + ^ q{Xn,k) + 2;n,n 

n—1 n—1 n—1 

= ^ A*?/n,fc + ^ M(3;n,fe - t/n,fc) + ^ q{x„,k) + Xn,„ 
fe=0 ft=0 fe=0 

= /l + /2 + /s + /4. 

It is straightforward to see that lim„_>oo I2 + I3 + I4 = 0. The first term 

n-1 ^ ^ n-1 ^ 

fe = ^ 2 ft=o 2 n 

can be viewed as a Riemann integral sum, so that 



lim InLn (-) = f^x f '^^ = -2 In (l 
which immediately implies (|19p . □ 



7 A functional limit theorem 

In this section we prove the following theorem on diffusion approximation 
for the process Y: 

Theorem 10 Let 

Z4t) = ^^, n€n,t€R+. 
n 

Then, as n ^ 00, the distribution of Zn converges weakly in the Sko- 
rokhod topology in D[0, cx)) to the unique nonnegative weak solution Z of 
the stochastic ltd equation 

dZ{t) = ^dt + y/jj.Z{t)dWit), 
Z(0) = 0. 

Proof: Since the initial point Z{0) = is an "entrance and non-exit" 
singular point for the positive semi-axis (see the classification of singular 
points in |IM74] ), the existence and uniqueness of a nonnegative solution 
for all positive times is guaranteed. Let us define 

b{x) = n, and a{x) — fi ■ maxja;, 0}, x £ R, 

and extend the equation above to the negative semi-axis by 

dZ{t) = b{Z(t))dt + ^Ja{Z{t))dW{t). (25) 
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An obvious argument shows that there is no solution starting at and be- 
ing negative for some t > 0. Therefore the weak existence and uniqueness 
in law hold for (|25|) . According to Section 5.4B of |KS98j . this existence 
and uniqueness is equivalent to the well-posedness of the martingale prob- 
lem associated with b and a. 

We will use Theorem 4.1 from [EK861 Chapter 7] on diffusion approx- 
imation. The coefficients a, b were defined on the whole real line so as the 
theorem applies directly, with no modification. We proceed to check its 
conditions. 

We must find processes An and B„ with the following properties: 

1. Trajectories of A„ and Bn are in D[0, oo). 

2. An is nondecreasing. 

3. AIn = Zn — Bn aud Mn — An are martingales with respect to the 
natural filtration generated by Zn , An , B 

n • 

4. For every T > the following holds true: 



lim Esup \Zn{t) - Zn{t~)\ 



lim E sup \ An{t) — Anit- 
lim E sup \Bn{t) ~ Bn{t~ 



sup 

t<T 



Bn{t) 



b{Zn{s))ds 



= sup \Bn{t) — ^lt\ 
t<T 



sup 

t<T 



We shall need the following lemma: 
Lemma 1 



(26) 
(27) 
(28) 



(29) 



An{t) - / a{Zn{s))ds 0, n — )■ oo. 



(30) 



EK+i|y, = fe] 

ElYj'+.lYj = k] =1 

e[y;'+,\yj = k] =1 



l)Bl 



+ k, 

--Bs + 3(fc - l)B2 + {k- l)(fc - 2), 
--Bi + 4{k - l)Bs + 6{k ~ l)(fc - 2)B2 + 3(A; 
+ (fc-l)(fc-2)(fc-3), 
E[y/+i]yj = k\ =B5 + 5(fc - 1)B4 + 10(fc - l)(fc - 2)Bi + 10(/c - 1)53-82 
+ 15(fc - l)(fc - 2)bI + 10(fc - l)(fc - 2)(fc - 3)52 
+ (fc- l)(fc-2)(fe-3)(fc-4). 
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Proof: For the first of these identities, we write 

fc-1 



0<ii,...,ifc<D 
D 



ii =0 



D 



12=0 



+ k{k-l) ^ iiCp'^e-"^'! 
\n=o 

= i(fcB2 + fc(fc - 1)) = Ba + fc - 1 

= M + fcj 

where we used the symmetry of the terms + . . .'i 
ik-iik and (fT6)) . Next, 



1112 + ilia + . . . + 



(ii + . . . + jfc)^p' 



3 il+...+ifc -/3(Bii+...+i5i^) 



0<ii,...,ii.<D 



= - {kB3+3k{k - l)B2 + k{k - l)(fc- 2)) 

= 53 + 3(fc - 1)B2 + (fc - l)(fc - 2), 

and the other two identities in the statement of the lemma can be obtained 
in a similar way. □ 

Returning to the proof of the functional limit theorem, let us find the 
coefficient Bn(t) first. The process Zn is constant on any interval of the 
form [j/n, (j + Due to Lemma [T] 



1 



= Z„{t) + p— , 



(31) 



so that we can set Bn{t) = ^,[nt]/n to satisfy the martingale requirement 
on Mn = Zn — Bn- Notice that with this choice of relations (|28|) 
and (|29p are easily seen to be satisfied. Lemma [T] also implies 



Zn{t+-] Zn{t) 



Bs + 3{nZn{t) - l)B'2 + {nZn{t) - l)(nZn(t) - 2) 



(32) 



SO that for t G — Z, 

n. ' 



Zn{t) 



= -pZ„{t) + ^(Ba - B| - B2 + 1) 



Therefore we can set 
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to satisfy the martingale requirement on — A. Notice that An is 
nondecreasing since 



-^iZn{t) + ^(53 - Ba - 52 + 1) > ^{B2 - 1) + ^(Bs - Ba - B2 + 1) 

> ^(B3-B|) >0, 

where the last inequality follows from the Cauchy — Schwartz inequahty 
and Bi = 1. So properties 1-3 are satisfied, and (|30|) follows from the 
definitions of a and A, and the convergence 

lim sup y -!-(B2 -l) + -^{Bs^ Bl - B2 + 1) = 0. 
To prove (|27p we use the definition of A to write 



EsUp|A„(t) -^n(t-)l < -E sup ^'n ( ^ ) + ^ (B3 - -83 - -B2 + 1 



n I 



so it suffices to prove that Esupi^j,Z„ (^) is bounded. The definition 
of M„, Lyapunov's inequality and Doob's maximal inequality for sub- 
martingales imply that for some c > 0: 



E sup Z„ - <nT + E sup 



J-<T 



Mn I - 

n 



<iJiT + cv/2(E^2(r) + M'r2). 

Lemma [1] implies that EZ^{T) has a limit, as n oo, so that (|27[) is 
verified. 

A lengthy but elementary calculation based on Lemma [T] shows that 
E[(y,+i - Y,f\Yn] < c{Yf + 1) 



for some constant c > 0, so that we can write 



Esup[Zn(t) - Zn(t-)Y < 

t<T 



Esnp{Znit) ~ Zn{t~)y 



1/2 



1/2 



j<nT 



1/2 



Since Lemma [T] implies that for some constant ci > 0, 



E(K," + 1) < ci/, jeN, 
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we conclude that 



Esup{Z„{t) - Zr^(t-)f < J-j-n- cm'^T^ 0, n ^ oo, 



and the proof of the theorem is complete. 



□ 



8 Diffusion limit for finer structure of the 
random tree 

In this section we present a non-rigorous and sketchy description for the 
diffusion limit of the infinite Markov random tree itself rather then its 
width given by at time n. Let us fix any time ng and divide all Y^^ 
vertices into r nonempty disjoint groups. For any n > no denote the 
progeny of i-th group at time n by Vi^n. 

We want to study the coevolution of (Vi,„, . . . , Vr,n)- Though each Vi^n 
is not a Markov process, it is elementary to see that the whole vector is 
a homogeneous Markov process. We would like to compute the diffusion 
limit for this vector under an appropriate rescaling; 

We need to find the local drift and diffusion coefficients for the limiting 
process. Let ji + . . . + jV — k. Then computations similiar to Lemma [1] 
produce 



Vl,m + 1 jl I 1 T/ ^ 1 A \ 
—\yi,[nt], ■■■ , yr,[nt]) = — Ul, ■ ■■,Jr) 



so, by symmetry, the local limit drift is 



. Jl/"- 1 
k/n n' 



bi{v) = fi- 



Vl + . . . + Vr 

Similarly, the diagonal terms for local diffusion: 



Vl,m + 1 jl 



n n I \ n 



.Jl ^ ^ k — 



— ) j — (^l,[nt], • • ■ , K-,[ntl) = — O'l, • • • , jr 



and 

aii{v) = ^.Vi. 

For the off-diagonal terms a computation produces 

E[(Vl,m + l - jl){V2,m + l - J2)| Vl,m = jl , . . . , K,m = jm] = 0, 

so that 

ttij =0, i / j. 
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So, the limiting equations are 



Let us introduce cumulative counts 

Uj =Vi + ... + Vj. 

Then Z{t) = Ur{t), and 

U, I 

d(/j = p^jj-dt + VMt^il{i7i>o}rfWi + • • • 



... + v/RtWVOl{c/,-i/,-i>o}dM/. (33) 

Then, for each < iii < . . . < iir, we can solve this equation with initial 
data 

(f/l(to), ■ • • , Ur(to)) = {Ul,. . . , lir), 

which gives a random nondecreasing map 

$^$to-u^{U{t))t>to- (34) 

Here u runs through the set {ui, . . . ,Ur}. It is clear though that if we 
insert another point u between and u* = Ur, then solving the stochastic 
equation above for the modified set of initial points, we see that the the 
new solution map is a monotone extension of the old one. Adding points 
of a countable dense set one after another, we can extend the solution 
map onto u £ [0, u*]. So, for each m* > we are able to define a random 
monotone map $ : [0,u*] r1^"'°°\ 

Our last point is to represent these solution maps via stochastic inte- 
grals w.r.t. a Brownian sheet {W{x,t))t,x>o, i.e. a continuous Gaussian 
random field with zero mean and 

coy(W{xi,ti), W{x2,t2)) = (a;i A X2)(ti A t2), xi,X2,ti,t2 > 0. 

Equations (|33|) imply that ^{u, t), t > to, u £ [0, u*] is equal in law to 
the monotonone (in u) solution of the following SPDE: 

<E>(ii t) f 

d^{u,t) J .■, dt+ / l[o,^*(„,t)]VK(da; X dt), 

$(u,fo) =u, u e [0,11*]. 

A rigorous treatment of the limiting solution $, and a precise convergence 
statement will appear elsewhere. 
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